Towards Adaptive Multi-Robot Coordination Based on Resource Expenditure Velocity: Extended Version

نویسندگان

Dan Erusalimchik

Gal A. Kaminka

چکیده

In the research area of multi-robot systems, several researchers have reported on consistent success in using heuristic measures to improve loose coordination in teams, by minimizing coordination costs using various heuristic techniques. While these heuristic methods has proven successful in several domains, they have never been formalized, nor have they been put in context of existing work on adaptation and learning. As a result, the conditions for their use remain unknown. We posit that in fact all of these different heuristic methods are instances of reinforcement learning in a one-stage MDP game, with the specific heuristic functions used as rewards. We show that a specific reward function—which we call Effectiveness Index (EI)—is an appropriate reward function for learning to select between coordination methods. EI estimates the resource-spending velocity by a coordination algorithm, and allows minimization of this velocity using familiar reinforcement learning algorithms (in our case, Q-learning in one-stage MDP). The paper analytically and empirically argues for the use of EI by proving that under certain conditions, maximizing this reward leads to greater utility in the task. We report on initial experiments that demonstrate that EI indeed overcomes limitations in previous work, and outperforms it in different cases.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Multi - Robot Coordination Based on Resource Spending Velocity ( Extended

متن کامل

Towards Adaptive Multi-Robot Coordination Based on Resource Expenditure Velocity

متن کامل

Adaptive multi-robot coordination based on resource spending velocity

متن کامل

Design of an Adaptive Fuzzy Estimator for Force/Position Tracking in Robot Manipulators

This paper presents a stable new algorithm for force/position control in robot manipulators. In this algorithm, position vectors are measured by sensors and then used in the control law. Since using force sensor has some issues such as high costs and technical problems, an approach is presented to overcome these issues. In this respect, force sensor is replaced by an adaptive fuzzy estimator to...

متن کامل

Towards a Probabilistic Roadmap for Multi-robot Coordination

In this paper, we discuss the problem of multirobot coordination and propose an approach for coordinated multi-robot motion planning by using a probabilistic roadmap (PRM) based on adaptive cross sampling (ACS). The proposed approach, called ACS-PRM, is a samplingbased method and consists of three steps including Cspace sampling, roadmap building and motion planning. In contrast to previous app...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Towards Adaptive Multi-Robot Coordination Based on Resource Expenditure Velocity: Extended Version

نویسندگان

چکیده

منابع مشابه

Adaptive Multi - Robot Coordination Based on Resource Spending Velocity ( Extended

Towards Adaptive Multi-Robot Coordination Based on Resource Expenditure Velocity

Adaptive multi-robot coordination based on resource spending velocity

Design of an Adaptive Fuzzy Estimator for Force/Position Tracking in Robot Manipulators

Towards a Probabilistic Roadmap for Multi-robot Coordination

عنوان ژورنال:

اشتراک گذاری